A fuzzy acoustic-phonetic decoder for speech recognition

نویسندگان

  • Olivier Oppizzi
  • David Fournier
  • Philippe Gilles
  • Henri Meloni
چکیده

In this paper, a general framework of acoustic-phonetic modelling is developed. Context sensitive rules are incorporated into a knowledge-based automatic speech recognition (ASR) system and are assessed with control based on fuzzy decision making. The reliability measure is outlined: a tests collection is run and a confusion matrix is built for each rule. During the recognition procedure the fuzzy set of trained values related to the phonetic unit to be recognized is computed, and its membership function is automatically drawn.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rescoring under fuzzy measures with a multilayer neural network in a rule-based speech recognition system

In this paper, a speech rescoring system is developed on a set of phonetic hypotheses produced by a bottom-up knowledge-based decoder. An original method to automatically compute a fuzzy membership function from top-down acoustic rules statistics is compared with a possibilistic measure. To aggregate the fuzzy degrees into a phonetic score, a mutilayer neural network is trained on the results o...

متن کامل

English-Spanish Bilingual Alphabet for Embedded Speech Recognition

This article introduces the phonetic alphabet that has been used to train acoustic models with a mixture of Spanish language and American English data, with the purpose of improving the speech recognition performance, when using Spanish, for speakers that are fluent in both languages, as is very frequently the case in the USA Spanish speaking population. We target a decoder that can be used in ...

متن کامل

A fuzzy synchronization algorithm for bimodal speech signals

This paper describes a rule{based fuzzy system that estimates the relationship between acoustic and visual speech and uses this estimate for the synchronization of not aligned audio{visual signals. The relations are quantiied by means of a set of rules, which associate typical mouth shapes (visual classes) to speciic acoustic classes. The visual and acoustic classes are learned from training da...

متن کامل

Speech Understanding and Speech Translation in Various Domains by Maximum A-posteriori Semantic Decoding

This paper describes a domain-limited system for speech understanding as well as for speech translation. An integrated semantic decoder directly converts the preprocessed speech signal into its semantic representation by a maximum a-posteriori classification. With the combination of probabilistic knowledge on acoustic, phonetic, syntactic, and semantic levels, the semantic decoder extracts the ...

متن کامل

A Stack Decoder for Continous Speech Recognition

We describe the structure, preliminary implementation and performance of an algorithm for doing continuous speech recognition. The algorithm, known as a stack decoder, proceeds by continually evaluating one-word extensions of the most promising partial transcriptions of an input utterance. The output is a list of candidate complete transcriptions, ordered by likelihood under a stochastic model....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996